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Abstract. This is an informal guide to the history of Heisenberg's matrix 
mechanics. It is designed for mathematicians with only a minimal background 
in either physics or geometry, and it is based upon Heisenberg's original argu- 
ments. 



1. Introduction 

The most dramatic shift in Twentieth Century physics stemmed from Heisen- 
berg's formulation of matrix mechanics [§]. In classical physics, quantities such as 
position, momentum, and energy are regarded as functions. In quantum theory 
one replaces the functions by non-commuting infinite matrices or to be more pre- 
cise, self- adjoint operators on Hilbert spaces. This enigmatic step remains the most 
daunting obstacle for those who wish to understand the subject. 

Although there exist many excellent mathematical introductions to quantum me- 
chanics (see, e.g., ^21, El) they are understandably focused on the development of 
mathematically coherent methods. As a result, mathematics students must post- 
pone understanding why non-commuting variables appeared in the first place. To 
remedy this, one can adopt a more historical approach, such as that found in Emch's 
beautiful historical monograph [o], the entertaining yet informative "comic book" 
[TT] , or Born's classic text [2] . 

In recounting the creation of quantum mechanics, the most difficult task is to 
describe how Heisenberg found the canonical commutation relation 

(1) PQ-QP=^I 

for the position and momentum operators Q and P. This equation is the final 
refinement of Planck's principle that a certain action variable is discrete, or more 
precisely that it can assume only integer increments of a universal constant h. 
Heisenberg used the more sophisticated formulation of Bohr and Sommerfield that 
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for periodic systems one has the "quantum condition" 

(2) <j> pdq = nh 
(see §4). 

In the words of Emch (see [Sj, p. 262) "one can only propose some very loose 
a priori justifications" for the derivation of (JTJ from J2J). Even Born, who was 
apparently the first to postulate the general form of JU (see [Hj, p. 264), avoided 
discussing it, appealing instead to the Schrodinger model (0, p. 130, see also 
p. 224), and this is the approach that one finds in most physics texts. We will 
attempt to make Heisenberg's direct conceptual leap a little less mysterious, by 
deciphering an argument that Heisenberg presented in his 1930 survey ^UJ- At the 
heart of his computation is the observation that 

the analogue of the derivative for the discrete action variable is just 
the corresponding finite difference quotient. 

(see (Pt ). 

Shortly after Heisenberg introduced matrix mechanics, Schrodinger found an 
alternative quantum theory based upon the study of certain wave equations |16j . 
His approach enabled one to avoid a direct reference to Heisenberg's matrices. 
Although it is both intuitive and computationally powerful, "wave mechancs" is not 
as useful in quantum field theory. The difficulty is that it does not fully accomodate 
the particle aspects of quanta. In quantum field theory one must take into account 
the incessant creation and annihilation of particles associated with the relativistic 
equivalence of mass and energy. In particular, the number of particles present must 
itself be regarded as an integer valued quantum variable. In Born's words 0, p. 
130, "Heisenberg's method turns out to be more fundamental." 

Our goal has been to maximize the accessiblity of the material. In order to do 
this we have taken liberties with the mathematical, physical, and historical details. 
To some extent this is justified by the fact that regardless of how much care we 
might take, the discussion is necessarily tentative. Although Heisenberg's argument 
is mathematically quite suggestive, in the end we must discard these notions in favor 
of the operator techniques that grew out of them. 

2. Atomic spectra, Fourier series and matrices 

The crisis that occured in classical physics is clearly seen in the peculiar prop- 
erties of atomic spectra. If one sends an electric discharge through an elemental 
gas A such as hydrogen or sodium, the gas will emit light composed of very precise 
(angular) frequencies u>. The corresponding spectrum spA of such frequencies is 
quite specific to the element A. For a single frequency we have the corresponding 
representation 

cos(wi + a) = W (wt+o) = C-ie"^ + c + c ie luJt 

for suitable complex constants c_i, Co, c\. Superimposing these frequencies, we may 
describe the radiation by the sum 

(3) f A (t)= J2 c " e ^' 

u£sp A 

where sp A = spA U — spA U {0}. 
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There are obvious classical analogues of this phenomenon. If one strikes an 
object, the resulting sound can be decomposed into certain specific angular fre- 
quencies. In the case of a tuning fork, the resulting motion is harmonic, and one 
obtains a corresponding Fourier series for the amplitude of the sound wave in the 
form 

f(t) =Acos(ujt + a) = c-ie - *"* + c + c x e wt . 
for suitable complex coefficients Ck- If one instead plucks a guitar string, the result- 
ing sound is a combination of various frequencies, all of which are overtones, i.e., 
multiples of a fundamental frequency uj. Thus one has a Fourier series 

(4) f{t) = Y J c ^ l{noj)t - 

where for simplicity we assume that only finitely many of the c„ are non-zero. We 
define the (full) spectrum of / to be the cyclic group Zlu — {ncu : n e Z}. As is 
well-known, one can duplicate the sound of a guitar string by superimposing the 
pure frequencies as in 10} . 

More complicated systems (such as a bell) will have several degrees of freedom, 
and thus several fundamental frequencies. For a system with two degrees of free- 
dom one will have two fundamental frequencies w, u>' with the "almost periodic" 
expansions 

/(<)= Cn,n'e i(n *' + " V) *. 

We will restrict our attention to one degree of freedom. 

The linear space A(lu) of all functions of the form Q with finitely many non-zero 
terms is closed under multiplication since if we are given 

neZ 

g(t) = J2 d ^ l(nUJ)t 

neZ 

then 

/(*)<?(*) = J2 Ckdn-ke^+^-^t =Y J *ne i{nw)t , 

k : n€ L 'L nGZ 

where a n is the "convolution" 

(5) a n = (c* d) n = ^ c kdn-k, 
Furthermore A(u>) is closed under conjugation since 

where c* = c_„. In more technical terms, the * -algebra A(uj) is a representation of 
the group *-algebra C[Z]. This result, of course, stems from the fact that sp/ = Zui 
is a group under addition. 

Returning to atomic spectra, it is tempting to regard © as some kind of Fourier 
series. There are several problems with this interpretation. 

First of all, we are actually interested in analyzing the property of a single atom. 
In this case it is inappropriate to actually "add up" the series ©■ For example 
(getting a little ahead of ourselves) a hydrogen atom will radiate only one frequency 
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at a time corresponding to the electron taking a particular orbital jump. Thus 
superpositions do not occur when one "watches" a single atom. For this reason it 
is more accurate to let f(t) stand for the array {c UJ e lut ) u] £ SPo A- 

Secondly, in striking contrast to the classical models, it is not useful to consider 
the additive group generated by sp A Given lo S sp A, one cannot expect to 
find any of the overtones mo in sp A Nevertheless the set sp A does display an 
exquisitely precise algebraic structure, called the Ritz combination principle. We 
may doubly index spA, i.e., we may let sp A = {io m _ n } m n6N , in such a manner 
that 



(0) 



for all m,n,p G N. In particular, w m . m + tOm.m — ^m,m and thus u;„ 
Furthermore, to m .n + ^n.m = ^m,m = 0, and therefore to n ,m - 
double indexing of the spectrum, our array becomes a matrix: 

(7) f(t) = | 



Using this 



l e Jm,n£F 



The set M{lo) of matrices (JTJl is already a linear space. Owing to ©, M(uj) is 
closed under matrix multiplication and the adjoint operation since 



f(t)g(t) 



k 

^ ^ C7n,kdk,n& ^ ' S ' 
k 



where a = cd is the usual matrix product, and 

/(*)* = [on.me-*--- 1 ] = [<„e— *] 

with a* the adjoint matrix. In fact one can regard M(uj) as a representation of the 
*-algebra C[N x N] of the full groupoid NxN. This point of view has been explored 
by Connes [3], but we will not pursue it further in this paper. 

It is easy to prove that any doubly indexed family w m! „ satisfying 10 must have 
the form 

for suitable constants C m . The precise values for the hydrogen atom are given by 
Balmer's equation 



(8) 



c c 

W m „ — 2-kR 7T — 

m n z 



where c is the speed of light, and R is known as the Rydberg's constant. 

Long before matrices were introduced, Bohr justified Rydberg's equation by 
combining Rutherford's model of the atom with a quantum condition on the action 
variable. This "old" quantum theory was to play a crucial role in the evolution of 
matrix mechanics. 



MATRIX REVOLUTIONS 



3. Action and quantization conditions 



Action is perhaps the least intuitive of the standard notions of classical mechan- 
ics. As usual, the easiest way to understand a physical quantity is to consider its 
units or "dimensions". We let M, C, and T denote units of mass m, length (or 
position) q, and time t (e.g. one can use grams, meters, and seconds). Given a 
physical quantity P, we let [P] denote its units. We have, for example, 



[velocity v] = 

[acceleration a] = 

[momentum p] = 

[force F ] = 

[potential energy V] = 

[kinetic energy T] = 

[total energy H] = 



dq 
dt 
d 2 c 



dt 2 



= CT~ 



= LT~ 



\mv 



MCT~ 
[ma] = MCT- 2 
[-Fq] = MC 2 T 
1 



—mv 
2 



[V + T] = MC 2 T~ 



Noting that they have the same dimensions, we simply regard V, T, and E — V + T 
as "different forms" of energy. We will often consider derivative and integral versions 
of these quantities, such as v and a above and the potential energy 



V 



= - J F(q)dq. 



The dimensions frequently mirror physical laws. For example the equation for force 
corresponds to Newton's second law. On the other hand the relativistic equation 
E = mc 2 corresponds to MC 2 T- 2 = M x (CT^ 1 ) 2 . 

The usual form of a travelling wave (in one spatial dimension) is given by 



(9) 



f(t, q) = Acos(uit + kq) 



where ui is the angular frequency (radians per second) and k is angular wavenumbcr 
(radians per meter). The corresponding dimensions are 



[angular frequency uj) — [radians] / [time] = T 
[angular wavenumber k] — [radians] / [distance] = C^ 1 

We recall these are related to the frequency v (cycles per second) and wavelength 
(of a cycle) A by lo = and k — 2ir/\l. 

Given an angular co-ordinate 9 measured in radians, we have the dimension 

[angular velocity uj] = 

By analogy with the momentum formula p — mv, the angular momentum is defined 
by L = iu>, where i is the "moment of inertia" , or cquivalcntly L is the signed length 
of the vector L = r x p, where r is the position vector and p is the momentum 
vector. Thus we have 

[angular momentum L] = MC 2 1 1 . 



dB 
~dl 
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In classical physics, the (restricted) action along a parametrized curve 7 is defined 
by the formulas 



J[ 7 ] = [ pdq= [ Tdt, 

J 'v J a 



'7 

and the actual motion taken by the particle is determined by finding the station- 
ary values of suitable variations of J with fixed energy (alternatively one can use 
a different variational principle involving the Lagrangian, see 0, The corre- 
sponding dimensions are given by 

[action J] = [energy] x [time] = [momentum] x [distance] = A4C 2 T~ 1 . 

We see from above that action has the same dimensions as angular momentum. 
Following w e will also use the action / = (l/27r)J. 

Quantum mechanics began in 1900 with Planck's paper He discovered that 
he could predict the radiation properties of black bodies provided he assumed a 
"quantum condition" . He essentially postulated that the action variable J asso- 
ciated with an atom can take only the discrete values nh, where h is a universal 
constant and n € N. 

An early task of quantum mechanics was to reconcile the particle and wave 
properties of "quantum objects" such as photons and electrons. Einstein related 
the energy E and momentum p of a photon to the frequency v and the wavelength 
A of the corresponding wave. Noting that E/v and pX are action variables (see 
above), he predicted that each of these equals the "minimal action" h, i. e., we 
have the Einstein relations 

E = hv = hio 
p = h/X = hk 

where h = h/2-K. Fifteen years later de Broglie 0] proposed that these relations 
were valid for all particles exhibiting the wave-particle dichotomy, including the 
electron. It was a short step from there to finding a wave equation for which 
the corresponding functions @ are solutions. This is precisely the Schrodinger 
equation. 

In 1913 Bohr used the Planck-Einstein quantum condition to explain the spec- 
tral lines of the hydrogen atom pQ. He proposed that the electron is constrained 
to particular circular orbits by the quantum condition. To be more specific, he 
assumed that the electron had a specific energy E m in the m-th orbit, and that it 
drops down (respectively jumps up) to the n-th orbit, it loses (respectively absorbs) 
energy E m — E n , which is carried away or brought by a photon with frequency 

E — E 

(10) U m , n = . 

When Bohr used the classical Coulomb law to calculate the angular momentum L 
of the electron in the m-th orbit, he discovered that it was given by L — mh for an 
integer m. In fact by using the Hamiltonian theory from the next section, he and 
Sommerfield showed that this coincides with Planck's quantum condition J = mh, 
and the latter is also true for arbitrary closed orbital motions. Within a few years, 
Bohr's theory was used to predict the frequencies of the spectral lines for a variety 
of systems. 

Bohr also formulated a fundamental asymptotic property for the spectral values 
which he called the correspondence principle. Returning to the Rydberg formula, 



MATRIX REVOLUTIONS 



7 



he observed that for large to, the electrons behaved almost classically, in the sense 
that one obtained overtones. More precisely, a drop offc = m — tt,<Cto orbits 
resulted in the fc-th overtone of a fundamental frequency io m — 2-KRcjvr? : 



'^rn.m — k 



2ttRc(- 



= 2irk 



ku> r . 



1 

m? 
Rc 

3 



1 



(m - 
(1 



kf' 
- k/2m) 



m 3 J (1 — 2k/m + k 2 /m 2 ) 



A similar principle applies if k is negative. We will use the notation k <C n to 
indicate relatively small positive or negative jumps. 

In principle it would seem that we might have to consider infinitely many fun- 
damental frequencies oj m . However despite its nebulous character, Bohr used the 
correspondence principle to very accurately predict the value of the Rydberg con- 
stant R as well as the "radius" of a hydrogen atom. 

Bohr's "old" quantum theory suffered from a number of defects. In particular, 
the increasingly technical quantum conditions seemed unnatural, and it was difficult 
to calculate the "Fourier coefficients a m ^ n " . The quantity |a m .„| 2 measures the 
intensity of the frequencies <jj myn , or at the level of a single atom, to the probability 
that a jump from to to n might occur. Just as one cannot "in principle" predict 
when a radioactive atom might decay, one can cannot say when an electron will 
"jump". This is a prototypical example of the probablilisic nature of quantum 
mechanics. 

Heisenberg concluded that the weakness of Bohr's theory rested upon the fact 
that it was concerned with predicting the hypothetical singly indexed energies E n 
rather than the actually observed doubly indexed frequencies tJ m ,n- As we have seen 
above, it was this perspective that led him to consider matrices, . However to carry 
out his program he had to incorporate the quantum conditions into his framework. 



4. Phase space and action angle variables 

Quantization is typically applied to algebras of functions. Since the Hamilton- 
ian approach to mechanics is concerned with an algebra of functions on a suit- 
able parameter space, it is ideally suited for this process. What is particularly 
useful about the Hamiltonian formulation is that each function determines a one- 
parameter group of automorphisms, and in particular, the energy function deter- 
mines the physical evolution of the system. Let us summarize this theory as quickly 
as possible. 

Let us first suppose that we are given a parameter space M = M. n . We let V(M) 
be the algebra of infinitely diffcrcntiable functions on M and T(M) — My. K" be the 
coresponding tangent space. We recall that we regard (x, v) e TM as a "tangent 
vector at x" , and that it determines a corresponding directional derivative. Given 
x E M and v = v j e j e we define 

D (XtV) : V(M) -R : / ^^(a:). 
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Since tangent vectors are only used to indicate the directonal derivatives that they 
define, we use the notation 

A vector field is a mapping 

F : M -» T(M) : x h-> £ T X (M), 

and we may write 

Given / G T>(M), we have .Dp : a; i— > Dpi x \f is again a smooth function on M, and 
the mapping 

D = Dp : V(M) -> P(M) 

is a derivation of the algebra V(M), i.e, we have 

D(fg) = D(f)g + fD(g). 

As is well known, all derivations of V(M) arise in this manner (see |18p. 
A curve 

x : (a, b) -»■ M : t i-> a?(t) = (asift), . . . , ar„(t)) 

is an integral curve for a vector field i* 1 if for each t, x'(t) = F(x(t)). Thus x(t) = 
(xi(t), . . . ,x n (t)) is just the solution to the system of first order differential equa- 
tions 



dxj (t) 



Fj(x(t)). 



Under appropriate conditions we may find an integral flow for the vector field, i.e, 
a family of mapping at : M —* M such that for each x £ M, x i— > Uf/ is an 
integral curve, and furthermore cr t+t i = a t o a t , , ao = I. This in turn determines 
a one-parameter group of algebraic automorphisms at of the algebra T>, where 
atf(x) = f{o~-tx). Using power series one finds a simple relationship between the 
derivation Df and the automorphism group at : 

D F (f) = lim 

h— >o h 

±n 

a t (f) = e tD -f = Y / ^D F (f) 

Turning to physics, let us consider a single oscillating particle with one degree 
of freedom. The Newtonian equation of motion is given by F = ma. Let us assume 
that the force F only depends upon the position q. Thus we are considering the 
second order equation 

d 2 Q 

F(q(tj) = 

Since we have restricted to one spatial dimension, F is automatically conservative, 
i.e., F(q) = —V'(q) for some function V, namely V(q) — — J F(q)dq. 

We begin by replacing Newton's equation by two first order equations. Although 
there are many ways this can be done (e.g. one can let dq/dt — v, and dv/dt = F/m) 
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Hamilton found a particularly elegant way of doing this. Specifically we use the 
variables q and p = mv. The corresponding equations are 

dq dH 
dt dp 

<"» I - -f 

where 

H(q,p) = ^-+V(q). 

We may regard the solutions curves j(t) = (q(t),p(t)) as as the integral curves of 
the symplectic gradient vector held 

1TT dH 9 dH d 
sgradff = -5- -5- - -5-^-- 
op dq dq op 

in the phase space M2 = K 2 of variables (q,p). This quantity is the "symplectic" 
analogue of the usual gradient 

,„ dH d dH d 
gradtf = -£--5- + 

dq dq dp dq 

but it is not necessary to go into details. 

In fact an arbitrary function a(q,p) on Mi determines a vector field 

da d da d 
sgrada = — — - ^--5-, 
dp dq dq dp 

and thus corresponding flow 

a a t : M 2 -» Af a , 

where 7(f) = c"(a;o )(q(t), p(t)) is a solution of the "Hamiltonian system" 

dp 

dp da 
dt dq 

The Poisson brackets of two functions a and 6 is defined by 
{a,6} = (sgrad a) (6) = -----. 
In particular, we note that if {a, 6} = 0, then letting (q(t),p(t)) be an integral curve 

of dm, 

d& 96 dq 96 dp 96 da 96 9a 

— = 1-| tL = = 

dt dq dt dp dt dq dp dp dq 
i.e., the function 6 is constant on the orbits of a. Since {a, a} = 0, we see that a is 
constant on its own integral curves. 

Perhaps the most striking attribute of the phase space parametrization is that 
the area pq of a rectangle has the dimensions 

[momentum] x [distance] = AiC 2 l 1 

i.e., area is an action variable. This link between the notion of area (or more 
precisely the area two-form ft = dp A dq) and a physical parameter is one of the 
most powerful features of the Hamiltonian theory. We say that a change of variable 
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Q(q,p), P(q,p) is canonical if it preserves the area form, i.e., we have that the 
Jacobian is 1: 

_ d{Q,P) _ dQdP dQdP 
d(q,p) dq dp dp dq 
If that is the case, then the dynamical system Q(t) = Q(q(t),p(t)), P(t) = P(q(t),p(t)) 
is also Hamiltonian, i.e., it has the form 

dQ dH 
~dt ~ ~d~Q 
dP dH 
~dt ~ 

where H(Q, P) = H(q(Q, P),p(Q, P)). To see this we note that 
dQ dQ dq dQ dp 

dt dq dt dp dt 

dQ_dH_ dQ dH 
dq dp dp dq 

dQ/^dlldQ m^\_dQ/'dH_dQ dH dP\ 
dq \dQ dp + dP dp ) ~ dp \ dQ dq + dP dq ) 
dHd(Q,P) _ dH 
dP d(q,p) ~ ~dP' 

and a similar calculation may be used for the second equation. It is also easy to 
see that a canonical change of variables will leave the Poisson brackets of functions 
invariant. Given a Hamiltonian co-ordinate system (Q,P), we say that Q and P 
are conjugate variables. 

Let us assume that our system is oscillatory, i.e., all of the solution curves 
(q(t),p(t)) are closed. We may assume that (q(0),p(0)) = (q(T),p(T)), where T 
depends upon the orbit. Our goal is to find the "simplest Hamiltonian co-ordinate 
system" (6, 1) by using a canonical transformation, with the properties 

• H(6,I) = H(I), i.e., H doesn't depend upon 6, and 

• 9 increases by 2ir on each closed orbit. 

Given such a system, we have 

dJ <91f 
dt ~ ~1W ~ 

and thus I and H(I) are constant on each orbit 7. It follows that 

_ d6 _ dH 

UJ ^dt^~dT 

is also constant on each orbit 7, i.e., ui = = ^(7), and 9{t) = ut + C for some 
constant C. We may assume C = 0, and from the second property, uj = 2-k/T. 

The canonical transformation from (q,p) to (9, 1) transforms the area A enclosed 
by an orbit j(t) = (q(t),p(t)) to the area R of the rectangle < 9 < 2n, < I < 
1(7). Since the purported transformation is canonical, we have 

j> pdq = A = R = 271-/(7), 

J -y 

where 7 is the unique integral curve that passes through (q,p). Thus assuming that 
we can find a canonical transformation with the desired properties, I is an action 
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variable. For the proof that the transformation exists (and a formula for 9) we 
recommend or [H]. We define 

I = hi pdq 

to be the action variable and 9 the angle variable. As one would expect, 9 is 
multivalued since it increases by 2ir on each circuit of an orbit. 

The action, angle variables enable us to use Fourier series in our analysis of a 
periodic motion. Given an arbitrary function a on M2 and using the action-angle 
variables, the function a(I, 9) will have period 2tt in 9. It thus has a Fourier series 



(12) a(I,9)=J2< n > 



where a(n) is a function of/. Substituting the solution of the Hamiltonian equations, 
we obtain A function of time: 

o(t) = a(n)e muJt 

where a(n) is constant on the orbit. 

5. The commutation relation 

We will identify the energy variables H and E. There is a close parallel between 
the classical formula 

_ dH _ dE 

and Bohr's difference formula 

Em Ejm—h 

n 

To make this more explicit, let us "discretize" the action variable I by letting 
/ = mh and A^I — kh. Then according to Bohr's correspondence principle, if 

k <C m, 

AkE A^E E m — E m _k i i dE 

-r—F = ~TT~ = 1 = k w m,m-fe ~ k kuj = —. 

Afe/ kti n ol 

It thus appears that Bohr's correspondence principle is embodied in the fact that 
the finite difference with respect to the discrete action variable / approximates the 
differential quotient with respect to the continuous action variable /. For this reason 
it seems justifiable to apply this to arbitrary quantum variables and their classical 
analogues. We will use the symbolism 

(13) Aj^dl 

(see The difference operator will be applied to a matrix variable by the 

formula 

(A^A) (to, n) = A(m, n) — A(m — k,n — k). 
In his calculation, Heisenberg concentrated upon the Fourier coefficient functions 
a{£) of a function a on the phase space M2 and the scalar matrix coefficients A(m, n) 
of a matrix A in the "expansions" 

a = ^a{l)e Uu3t 



A = [A{m,n)e iu>m ' nt ] 
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If a is the classical function variable "reduction" of the matrix variable A, then for 
£ = to — n -C to the coefficient A(m, n) of e luJm - nt should approximate the coefficient 
a(£) of the overtone (e iujt ) e . We will write A a, and A(m, n) a(£). We wish to 
show that if A ~~> a and B ~^b, then [A, B] {a, b} . 
If j, k <C to then 

1 c^tz 

A(m, to - j) ~* a(j) = -rj -1 (for j ^ 0) 

(A fc A)(m,TO-j) -» kh—(j). 

The equality is seen if one takes the derivative of Ijl2(l with respect to 6. The second 
reduction is a formal consequence of fT3|l . 

Let us suppose that we are given matrices A and -B and functions a and 6 with 
A a and B -w &. li £ = m — n<CTO 

(AS - BA)(m,n) 

= A(m,m — j)B(m — j,m — j — k) 

j+k=£ 

— B{m, to — k) A(m — k, m — k — j) 

j+k=i 

= [^4(to, to — j) — A{m — k,m — j — k)\B{m — j,m — j — k) 

j+k=t 

— A(m — k.m — j — k)[B(m, to — k) — B(m — j, to — j — k)] 

j+k=e 

= (A k A)(m,m- j)B(m- j,m- j - k) 

j+k=e 

—A(m — k, to — k — j)(AjB)(m, to — k) 

j+k=t 



da db da db 
~dl~dd ~ ~dd~dl 

{a,b} (£). 



(0 



(see © — we note that f| (0) = |f (0) = 0). 

As Heisenberg points out in a footnote, this calculation is problematical even 
as a heuristic guide. Although n — rn = £ = j + k is assumed "relatively small" 
with respect to to and n, we are summing over arbitrary j, k with j + k = £. 
Heisenberg explains this away by pointing out that if j is large it will follow that k 
is large (usually with opposite sign) and vice versa, and thus all the matrix positions 
(to, to — j), (to— j, m — j — k), (to, to — j), and (to — k, m — k—j) will be distant from 
the diagonal. He states that the corresponding matrix elements must be negligible 
"since they correspond to high harmonics in the classical theory" . 
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We conclude 

[A, B]~*^ {a, b} 

Since 

, , dp dq dp dq ^ 
dp dq dq dp 

if we let P and Q be the quantized momentum matrices, i.e. P ~-> p and Q q, 
we are led to postulate the commutation rule 

[P,Q] = ?/. 
% 

This relation is the most essential algebraic ingredient of quantum mechanical 
computations. The reader may find early instances of these calculations in 
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